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CLAEVIS 



1. A method for quantitating the individual contribution of a mutation or combination 
of mutations to the drug resistance phenotype exhibited by HTV, said method 
comprising the step of performing a linear regression analysis using data from a 
dataset of matching genotypes and phenotypes, 

wherein the log fold resistance, pFR, of each HIV strain is modelled as the sum of 
aU the mdrvrdual resistance contributions for each of the mutations or combinations 
of mutations that occur in HTV according to the following equation; 
pFR = fl A M A + ^ B M B + p ^ + ... + p zMz +£ 

wherein each individual resistance contribution is calculated by multiplying a 

mutation factor, M A , M B Mz, for each mulation or combmation ofmutations 

by a resistance coefficient fij, fi B , fizi 

wherein for a combination ofmutations, the mutation factor M„ presents the co- 
occurrence of one mutation with other one or more mutations and the coefficient 
represents the synergy or antagonism between the one mutation with the other one 
or more mutations; 

wherein the mutation fector assigned to each mutation or combination ofmutations 
reflecte the degree to which that mutation or combination of mutations is present in 
the HTV strain and, if present, to which degree the mutation is present in a mixture; 
wherein each resistance coefficient reflects the contribution of the mutation or 
combmation of mutations to the fold resistance exhibited by the strain; 
and wherein the error term e, represents the difference between the modelled 
prediction and the experimentally determined measurement. 

2. A method according to claim 1, wherein correlations are removed from the dataset 
for correlated mutations where not all correlated mutations contribute to the drug 
resistance phenotype, using an algorithm to track the change in pFR for each 
mulation as the effecte of individual mutations or combinations ofmutations are 
removed from the dataset. 

A method according to claim 2, wherein the algorithm performs the following 
steps! "~" 

a) calculate average pFR for all mutations with a sufficient count in the 
database to be significant; 
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b) determine foe extremes (maximum, minimum), and select the mutation with 
the pFR furthest away from the global average; 

c) remove all virus strains that have the selected mutation from the dataset and 
reiterate from step a); 

d) stop when the selected mutation in step b) has an average pFR that 
approximates to the global average; 

such that removing virus strains with a certain resistance causing mutation 
results in an increase of the average pFR for correlating mutations, which thus 
have a higher average pFR. 



4. A method according to any one of claims 1-3, wherein the algorithm performs the 
following steps: 

a) calculate correlation coefficient between all mutations with a sufficient 
count in the database and the pFR; 

b) determine the extremes (maximum, minimum), and select the mutation with 
tiie highest absolute value of correlation coefficient; 

c) calculate a linear model for the pFR with the selected mutation(s) (from step 
b), all previous iterations); 

d) take the residue; 

e) calculate correlation coefficient between ah mutations wilh a sufficient 
count in the database and the residue; 

f) determine the extremes (maximum, minimum), and select the mutation with 
the highest absolute value of correlation coefficient; 

g) calculate a linear model for the pFR with the selected mutation(s) (from step 
f), all previous iterations); and 

h) reiterate from step d); 

i) stop when the selected mutation in step g) has a correlation coefficient that 
approximates to zero. 



5. 



A method according to any one of the preceding claims, wherein censored values i 
the genotype / phenotype database are replaced by a maximum likelihood 
estimation. 
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6. A method according to claim 5, wherein for each iteration of the linear regression, 
the following steps are performed until the predictions converge: 

a) Calculate a linear regression model without censored values; 

b) Use the phenotypic measured value V 0 as if the censor was " = ", e.g. when a 
5 result is expressed as -log FR < 4, we will treat Vo as -log FR =4; 

c) Look at the prediction P from the model and apply either: 
When case '<'-censor: 

■ P < V 0 - 0.798 a (center of gravity of half Gaussian distribution) 

o Remove value from training data for next iteration 
10 ■ V 0 -0.798o<P<V 0 

o Use V' = V 0 - 0.798o for next iteration 

■ V 0 <P 

o Use V centre of gravity of tail (<V) of a normal distribution N (P, 
o) as value for next iteration. 
15 When case *>* -censor: 

■ P > V 0 + 0.798 o (center of gravity of half Gaussian distribution) 

o Remove value from training data for next iteration 

■ V 0 + 0.798 a>P>V 0 

o Use V' = V 0 - 0.798 a for next iteration 
20 - V 0 >P 

o Use V centre of gravity of tail (>V) of a normal distribution N (P, 
a) as value for next iteration. 

d) Calculate a linear regression model and for the censored values in the linear 
regression model, either remove Ihe data-point from the training set, or use V 

25 instead of the censored phenotypes measurement, as described in step c); 

e) Re-iterate from step b) until prediction converges. 
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. A method of identifying a mutation that effects the degree of drug resistance 
exhibited by an HTV strain using a method according to any one of the preceding 
claims. 
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8. A method according to claim 1 wherein the contribution of a mutation pattern to the 
drug resistance phenotype exhibited by an HIV strain is calculated, said method 
comprising the steps of: 

a) obtaining a genetic sequence of said HTV strain, 

5 b) identifying the pattern of mutations in said genetic sequence, wherein said 

mutations are associated with resistance or susceptibility to drug therapy, and 
c) calculating the fold resistance of the HTV strain as compared to the wild type 
HTV strain by performing a linear regression analysis according to claim 1 . 

10 9. A method according to claim 8, which incorporates a method according to any one 
of claims 1-7. 
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10. A method according to any one of the preceding claims, wherein in the case of 
small datasets for particular mutations or combinations of mutations, the method is 
applied recursively to the set of virus strains that exhibit those particular mutations 
or combinations of mutations. 

1 1 . A diagnostic method for optimising a drug therapy in a patient, comprising 
performing a method according to any one claims 7-9 for each drug or combination 
of drugs being considered to obtaining a series of drug resistance phenolypes and 
therefore assess the effect of the plurality of drugs or drug combinations on the 
predicted fold resistance exhibited by the HTV strain with which the patient is 
infected and selecting the drug or drug combination for which the HTV strain is 
predicted to have the lowest fold resistance. 

12. A method according to any one of claims 7-9 and 1 1, wherein the resistance 
coefficient for each mutation is calculated using a method according to any one of 
claims 1-6. 

13. Use of a method according to any one of claims 1-6 for assessing the efficiency of a 
patient's therapy or for evaluating or optimizing a therapy. 



14. A diagnostic system for quantitating the individual contribution of a mutation or 
combination of mutations to the drug resistance phenotype exhibited by an HTV 
35 strain, said system comprising: 

a) means for obtaining a genetic sequence of said HTV strain; 
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b) means for identifying the mutation pattern in said genetic sequence as compared 
to wild type HIV; 

c) means for predicting the fold resistance exhibited by the HIV strain using any 
one of the methods of claims 1-12. 

5 

15. A computer apparatus or computer-based system adapted to perform the method of 
any one of the claims 1-12. 
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1 6. A computer program product for use in conjunction with a computer, said computer 
program comprising a computer readable storage medium and a computer program 
mechanism embedded therein, the computer program mechanism comprising a 
module that is configured so that upon receiving a request to quantify the individual 
contribution of a mutation or combination of mutations to the drug resistance 
phenotype exhibited by HTV, or to calculate the quantitative contribution of a 
mutation pattern to the drug resistance phenotype exhibited by an HTV strain, it 
performs a method according to any one of claims 1-12. 



